Transfer Learning by Structural Analogy
نویسندگان
چکیده
Transfer learning allows knowledge to be extracted from auxiliary domains and be used to enhance learning in a target domain. For transfer learning to be successful, it is critical to find the similarity between auxiliary and target domains, even when such mappings are not obvious. In this paper, we present a novel algorithm for finding the structural similarity between two domains, to enable transfer learning at a structured knowledge level. In particular, we address the problem of how to learn a non-trivial structural similarity mapping between two different domains when they are completely different on the representation level. This problem is challenging because we cannot directly compare features across domains. Our algorithm extracts the structural features within each domain and then maps the features into the Reproducing Kernel Hilbert Space (RKHS), such that the “structural dependencies” of features across domains can be estimated by kernel matrices of the features within each domain. By treating the analogues from both domains as equivalent, we can transfer knowledge to achieve a better understanding of the domains and improved performance for learning. We validate our approach on a large number of transfer learning scenarios constructed from a real world dataset. Introduction and Motivation Re-using knowledge across different learning tasks (domains) has long been addressed in the machine learning literature (Thrun 1998; Caruana 1997; Daumé III 2006; Dai 2008; Blitzer 2006). Existing research on this issue usually assume that the tasks are related on the low level, i.e. they share the same feature space or the same parametric family of models, such that knowledge transfer can be achieved by re-using weighted samples across tasks, finding a shared intermediate representation, or learning constraints (informative priors) on the model parameters. However, examining knowledge transfer in human intelligence, we could find that human beings do not rely on such low-level relatedness to transfer knowledge across domains. Namely, we human beings are able to make analogy Copyright c © 2011, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved. across different domains by resolving the high level (structural) similarities even when the learning tasks (domains) are seemingly irrelevant. For example, we can easily understand the analogy between debugging for computer viruses and diagnosing human diseases. Even though the computer viruses (harmful codes) themselves have nothing in common with bacteria or germs, and the computer systems is totally different from our bodies, we can still make the analogy base on the following structural similarities: 1. Computer viruses cause malfunction of computers. Diseases cause disfunction of the human body. 2. Computer viruses spread among computers through the networks. Infectious diseases spread among people through various interactions. 3. System updates help computers avoid certain viruses. Vaccines help human beings avoid certain diseases. Understanding of these structural similarities helps us abstract away the details specific to the domains, and build a mapping between the abstractions (see Figure 2). The mapping builds on the high level structural relatedness of the two domains, instead of their low level “literal similarities”. In other words, the attributes of the “computer” and the “human” themselves do not matter to the mapping, whereas their relationships to other entities in their own domains matter. This is reminiscent of the learning-by-analogy paradigm in early endeavors in intelligent planing and problem solving. However, many previous operational systems in computational analogy, such as case-based reasoning, have used a simple similarity function between an old and new problem domain, whereby the features in the two domains are identical, albeit weighted. This similarity measure cannot handle some more intuitive cases of human problem solving, such as the above example, in which the similarity between the domains should be measured on the structural level. And such a “structural similarity” can only be determined if we can correctly identify analogues across completely different representation spaces. On the other hand, in cognitive science, analogical learning indeed involves developing a set of mappings between features from different domains. Such a need is captured in structure mapping theory (Falkenhainer 1989; Gentner 1990) of analogical reasoning, which argued for deep reComputer Computer Infect Computer Viruses Prevent
منابع مشابه
Analogical Reinforcement Learning
Research in analogical reasoning suggests that higher-order cognitive functions such as abstract reasoning, far transfer, and creativity are founded on recognizing structural similarities among relational systems. Here we integrate theories of analogy with the computational framework of reinforcement learning (RL). We propose a computational synergy between analogy and RL, in which analogical c...
متن کاملAnalogy and Transfer: Encoding the Problem at the Right Level of Abstraction
Analogy and transfer studies belong to distinct research areas regarding both the theoretical frameworks and the experimental paradigms, although they both contribute to the fundamental question of the generalization of learning. The aim of this paper is to show that those fields might be reanalyzed within a common framework, emphasizing the role of the abstraction level at which the problem is...
متن کاملTesting the Structural Model of Job Characteristics, Organizational Climate and Extra-Organizational Factors on the Transfer of Education with the Role Mediation of Strategies Transfer
The purpose of this study was to investigate the role of job factors, constructive organizational climate and extra-organizational factors on the transfer of learning with the mediating role of learning transfer mechanisms on the consequences of learning. The research method was descriptive-survey and based on structural equations. The statistical population of the study included all managers, ...
متن کاملAnalogical Processes in Human Thinking and Learning
Much of humankind's remarkable mental aptitude can be attributed to analogical abil ity the abil ity to perceive and use relational similarity. In this chapter, we present an overview of analogy and describe its component processes, including structural alignment and inference projection, evaluation, schema abstraction and re-representation. We discuss how these component processes lead to lear...
متن کاملTesting the Structural Model of Job Characteristics, Organizational Climate and Extra-Organizational Factors on the Transfer of Education with the Role Mediation of Strategies Transfer
The purpose of this study was to investigate the role of job factors, constructive organizational climate and extra-organizational factors on the transfer of learning with the mediating role of learning transfer mechanisms on the consequences of learning. The research method was descriptive-survey and based on structural equations. The statistical population of the study included all managers, ...
متن کامل